curious case
The Curious Case of the Bizarre, Disappearing Captcha
While puzzling captchas--from dogs in hats to sliding jockstraps--still exist, most bot-deterring challenges have vanished into the background. As I browse the web in 2025, I rarely encounter captchas anymore. There's no slanted text to discern. No image grid of stoplights to identify. And on the rare occasion that I am asked to complete some bot-deterring task, the experience almost always feels surreal.
- North America > United States > Tennessee (0.04)
- North America > United States > California (0.04)
- Europe > Slovakia (0.04)
- Europe > Czechia (0.04)
- Information Technology > Security & Privacy (1.00)
- Leisure & Entertainment (0.98)
The curious case of the test set AUROC
Roberts, Michael, Hazan, Alon, Dittmer, Sören, Rudd, James H. F., Schönlieb, Carola-Bibiane
Test Set Whilst the size and complexity of ML models have rapidly and significantly increased over the past decade, the methods for assessing their performance have not kept pace. In particular, among the many potential performance metrics, the ML community stubbornly continues to use (a) the area under the receiver operating characteristic curve (AUROC) for a validation and test cohort (distinct from training data) or (b) the sensitivity and specificity for the test data at an optimal threshold determined from the validation ROC. Example validation and test set model output distributions, ROC curves coloured by threshold. We don't seek to discuss the individual The key strength of the ROC curve is its when evaluated on datasets from different shortcomings of the AUROC (e.g. Therefore, it is possible to extrapolation required for'degenerate' distributions) different thresholds, we gain great insight obtain consistently good AUROC values for However, a validation and test cohort of data whilst is a staple for ML researchers.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.15)
- Europe > Germany > Bremen > Bremen (0.14)
Detective McDavitt and the Curious Case of the Clown Wedgefish
How do you find an elusive animal that most people have never even seen dead in a fish market? Matthew McDavitt, above, knows how.Melody Robbins This story was originally published by Hakai Magazine and is reproduced here as part of the Climate Desk collaboration. Peter Kyne sits down at his desk to write a eulogy for a fish he's never met. No scientist has seen signs of the critically endangered Rhynchobatus cooki, or clown wedgefish, since a dead one turned up at a fish market in 1996. Kyne, a conservation biologist at Charles Darwin University in Australia who studies wedgefish, has worked only with preserved specimens of the spotted sea creature. "This thing's dust," Kyne thinks, feeling defeated as he writes the somber news in a draft assessment of the global conservation status of wedgefish species for the International Union for Conservation of Nature. Wedgefish are a type of ray.
The Curious Case of the Missing Google Assistant
Google executives hosted the company's I/O developer conference this week, an annual ritual that has in recent years centered on artificial intelligence. With OpenAI's ChatGPT and Microsoft's Bing chatbot seen as challenging Google's search domination, Google CEO Sundar Pichai seemed intent on projecting the message that his company is still the leader in AI--and is speeding up deployment of the technology. Google's own chatty large language model, Bard, was the headliner, and it is now publicly available in 180 countries. Following along behind came about a dozen generative AI product features and experiments that can do things like help programmers write code, draft emails, or generate speaker notes for Google Slides presentations. But hardly a word was said about Google Assistant, the clunkily named and voice-centric AI assistant that was the company's previous AI champion.
The Curious Case of Absolute Position Embeddings
Sinha, Koustuv, Kazemnejad, Amirhossein, Reddy, Siva, Pineau, Joelle, Hupkes, Dieuwke, Williams, Adina
Transformer language models encode the notion of word order using positional information. Most commonly, this positional information is represented by absolute position embeddings (APEs), that are learned from the pretraining data. However, in natural language, it is not absolute position that matters, but relative position, and the extent to which APEs can capture this type of information has not been investigated. In this work, we observe that models trained with APE over-rely on positional information to the point that they break-down when subjected to sentences with shifted position information. Specifically, when models are subjected to sentences starting from a non-zero position (excluding the effect of priming), they exhibit noticeably degraded performance on zero to full-shot tasks, across a range of model families and model sizes. Our findings raise questions about the efficacy of APEs to model the relativity of position information, and invite further introspection on the sentence and word order processing strategies employed by these models.
The Curious Case of How MS-excel Was a Nightmare for Bioinformatics
Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. How do we have no power in front of Excel's habit of arbitrarily changing the format to what it does like?
The curious case of developmental BERTology: On sparsity, transfer learning, generalization and the brain
In this essay, we explore a point of intersection between deep learning and neuroscience, through the lens of large language models, transfer learning and network compression. Just like perceptual and cognitive neurophysiology has inspired effective deep neural network architectures which in turn make a useful model for understanding the brain, here we explore how biological neural development might inspire efficient and robust optimization procedures which in turn serve as a useful model for the maturation and aging of the brain. Hopefully it would inspire the reader in one way or two, or at the very least, kill some boredom during a global pandemic. We are going to touch on the following topics through the lens of large language models: - How do overparameterized deep neural nets generalize? - How does transfer learning help generalization? Before we start, it is prudent to say a few words about the brain metaphor, to clarify this author's position on the issue as it often arises central at debates. The confluence of deep learning and neuroscience arguably took place as early as the conception of artificial neural nets, because artificial neurons abstract characteristic behaviors of biological ones (McCulloch and Pitts, 1943).
- North America > United States (0.14)
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
- Research Report (0.64)
- Overview (0.46)
The Curious Case of Data Annotation and AI - RTInsights
And for in-house teams, labeling data can be the proverbial bottleneck, limiting a company's ability to quickly train and validate machine learning models. By its very definition, artificial intelligence refers to computer systems that can learn, reason, and act for themselves, but where does this intelligence come from? For decades, the collaborative intelligence of humans and machines has produced some of the world's leading technologies. And while there's nothing glamorous about the data being used to train today's AI applications, the role of data annotation in AI is nonetheless fascinating. Imagine reviewing hours of video footage – sorting through thousands of driving scenes, to label all of the vehicles that come into frame, and you've got data annotation.
Curious Case of Actuarial Science, Geocoding, and Machine Learning - DZone AI
This article illustrates how Geocoding uncovers the untapped value within generally overlooked insurance categories, such as Life and Annuity, and how it can help address modern-day business challenges remarked by Orszag. While Geocoding in Big Data is gaining prominence within Property and Casualty (P&C), we believe the real opportunity lies in the actuarial adoption of AI framework capable of processing consumable inputs that weren't visible in the erstwhile "Ease of Geocoding" era. Establishing this premise for Life and Annuity, we then pivot towards crafting a general purpose Geo-inclusive architecture that can help actuaries of all disciplines apply Machine Learning to solve new generation of business problems, such as, dwindling subscribers or risk-attributed challenges, such as, Adverse Selection. Nearly all of the data in the insurance business has a location attribute, e.g. However, many insurance companies have not fully utilized this component besides billing and mailing purposes.